Learning Differentiable Programs with Admissible Neural Heuristics (1/24)

We first discussed some possibilities for adding LLM’s to the picture

The current architecture of the neural heuristic is FFN/RNN, is there benefit to making it a language model?

Could more constructs (rather than just if then else) be learned? Could improvement be improved?

While doing so, we talked about common assumptions in LLMs:

Since LLM’s for code are trained on “clean” data, would it be a fair assumption that it doesn’t generate trivial bugs?

We also discussed why we need a neural admissible relaxation instead of using some purely symbolic heuristic-based relaxation:

The neural relaxation is for learning a program that fits the input-output examples. However, maybe some symbolic-based heuristics could be considered for pruning branches.

Finally, we talked about the difficulties of supporting loops in the DSL.

Someone brought up a Martin Vechev paper that supports loops ← [edit: shashank] The vechev paper that i was referring to was quite different from what we were discussing. They essentially attempt to define NN primitives like tanh etc. in terms of abstract domains. It came to mind because they attempt to translate and describe these concepts of tanh etc. in another algebra: of abstraction domains. We’re interested here translating concepts on program primitives like loops etc. another algebra: linear algebra.
Paper i was thinking about: https://ggndpsngh.github.io/files/DeepPoly.pdf . See Section 4.
What i rather had in mind was TerpreT, which does define loops as differentiable objects.

We mentioned TerpreT and Stitch (to support loops soon?)
- Other related work: https://arxiv.org/abs/1611.02109
- Eric Zhang’s excellent notes on Nada’s course on PL+AI. This has notes on TerpreT and a bunch more: https://www.ekzhang.com/assets/pdf/CS_252r_Notes.pdf